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TITLE : METHODS OF DETECTING INTERACTIONS BETWEEN 
PROTEINS, PEPTIDES OR LIBRARIES THEREOF USING 
FUSION PROTEINS 

INVENTORS: Andrew D. Hamilton, Indraneel Ghosh, and Lynne Regan 
RELATED APPLICATIONS 

[0001] This application claims the benefit of priority of U.S. Provisional Application 
60/203,712, filed on May 12, 2000. 

FIELD OF THE INVENTION 

[0002] The present invention is related to the reassembly of fusion peptides into a 
functionally active protein complex. Specifically, the present invention provides a 
method of forming peptide complexes that associate through the combination of helical 
domains to form an antiparallel leucine zipper. The present invention is also related to 
the use of assays to investigate protein-protein interactions. The assays of the present 
invention involve the association of fusion proteins comprising GFP fragments and 
heterologous polypeptides into functionally active GFP that exhibits fluorescence. 

BACKGROUND OF THE INVENTION 

[0003] All publications and patent applications herein are incorporated by reference to 
the same extent as if each individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

Green Fluorescent Protein 
[0004] Green fluorescent protein (GFP), a relatively small protein comprising 238 
amino acids, is the ultimate source of fluorescent light emission in the jellyfish Aequorea 
victoria. The gene for GFP was first cloned by Prasher et al (1992, Gene, 111 :229-233), 
and cDNA for the protein produces a fluorescent product identical to that of native 
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protein when expressed in prokaryotic (E. coli) and eucaryotic (C. elegans) cells (Chalfie 
et al, 1994, Science, 263, 802-805). / 

[0005] The GFP excitation spectrum shows an absolution band (blue light) maximally 
\\ 395 nm with a minor peak at 470 nm, and an eprfission peak (green light) at 509 nm. 
K s/ The longer-wavelength excitation peak has greater photostability then the shorter peak, 
' but is relatively low in amplitude (Chalfie ft al, 1994, Science, 263:802-805). The 

crystal structure of the protein and of sexferal point mutants has been solved (Ormo et al, 
1996, Science 273, 1392; Yang et alYNature Biotechnol. 14, 1246). The fluorophore, 
consisting of a tripeptide at residujzfs 65-67, is buried inside a relatively rigid beta-can 
10 structure, where it is almost completely protected from solvent access. The GFP 
absorption bands and emission peak arise from an internal p-hydroxybenzylidene- 
imidazolidinone chromoph6re, which is generated by cyclization and oxidation of the 
tripeptide sequence Seryfyr-Gly sequence at residues 65-67 (Cody et aL, 1993, 
Biochemistry 32:1212^1218). 
15 [0006] GFP fluorescence in procaryotic and eucaryotic cells does not require exogenous 
substrates and cofactors. Accordingly, GFP is considered to have tremendous potential in 
methods to monitor gene expression, cell development, or as an in situ tag for fusion 
proteins (Heim et al, 1994, P.N.A.S. USA, 91,12501-12504): Chalfie and Prasher, WO 
95/07463 (March 16, 1995), describe various uses of GFP, including a method of 
20 examining gene expression and protein localization in living cells. Methods are described 
wherein: 1) a DNA molecule is introduced into a cell, said DNA molecule having DNA 
sequence of a particular gene linked to DNA sequence encoding GFP such that the 
regulatory element of the gene will control expression of GFP; 2) the cell is cultured in 
conditions permitting the expression of the fused protein; and 3) detection of expression 
25 of GFP in the cell, thereby indicating the expression of the gene in the cell. Methods such 
as those described by Chalfie and Prasher are advantageous compared to previously 
reported methods which utilized p-galactosidase fusion proteins (Silhavy and Beckwith, 
1985, Microbiol. Rev., 49, 398; Gould and Subramani, 1988, Anal Biochem., 175, 5; 
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Stewart and Williams, 1992, J. Gen. Microbiol, 138,1289) or luciferases, in that the need 
to fix cell preparations and/or add exogenous substrates and cofactors is eliminated. 
[0007] GFP is a valuable marker for intracellular protein localization. However, the 
fusion of GFP with structural proteins can alter their properties, resulting in loss of fusion 

5 protein localization, decreased GFP fluorescence or both. The fluorescence of this 
protein is sensitive to a number of point mutations (Phillips, G. N., 1997, Curr. Opin. 
Struct. Biol 7, 821-27). The fluorescence appears to be a sensitive indication of the 
preservation of the native structure of the protein, since any disruption of the structure 
allowing solvent access to the fluorophoric tripeptide will quench the fluorescence. 

10 Abedi et al (1998, Nucleic Acids Res., 26, 623-30) have inserted peptides between 

residues contained in several GFP loops. Inserts of the short sequence LEEFGS (SEQ ID 
NO: 9) between adjacent residues at 10 internal insertion Sites were tried. Of these, 
inserts at three sites, between residues 157-158, 172-173 and 194-195 gave fluorescence 
of at least 1% of that of wild type GFP. Only inserts between residues 157-158 and 172- 

15 173 had fluorescence of at least 10% of wild type GFP. 

Protein Reassembly using Leucine Zipper 
[0008] The unassisted reconstitution of proteins from peptide fragments has been 
demonstrated for several proteins; including ribonuclease (Richards et al, 1959, J. Biol 

20 Chem. 234, 1459-1465), chymotrypsin inhibitor-2 (Gay et al, 1994, Biochemistry, 33, 
7957 -7963), tRNA synthetases (Shiba et al, 1992, Proc. Natl Acad. Set U.S.A., 89, 
1880-1884), and inteins (Southworth, et al, 1998, EMBO J., 17, 918-926). Protein 
reassembly has thus become an important avenue for understanding enzyme catalysis 
(Richards et al, 1959, J. Biol Chem. 234, 1459-1465), protein folding (Gay et al, 1994, 

25 Biochemistry, 33, 7957 -7963), and protein evolution (Shiba et al, 1992, Proc. Natl. 
Acad. Sci. U.S.A., 89, 1880-1884). Recently, assisted protein reassembly or "fragment 
complementation" has been applied to the in vivo detection of protein-protein interactions 
in such systems as dihydrofolate reductase (DHFR) (Pelletier et al, 1998, Proc. Natl 
Acad. Sci. U.S.A., 95, 12141-12146; Remy et al, 1999, Proc. Natl. Acad. Sci. U.S.A., 96, 
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5394-5399; Pelletier et al, 1999, Nat. Biotechnol, 17, 683-690), ubiquitin (Karimova et 
al, 1998, Proc. Natl. Acad. Sci. U.S.A., 95, 5752-5756; Johnsson et al., 1994, Proc. Natl. 
Acad. Sci. U.S.A., 91, 10340-10344), and p-galactosidase (Rossi et al, 1997, Proc. Natl. 
Acad. Sci. U.S.A., 94, 8405-8410). These reassembly processes are contingent upon the 
proper choice of a dissection site within a protein and can be aided by techniques such as 
limited proteolysis, circular permutation (Baird et al, 1999, Proc. Natl Acad. Sci. U.S.A., 
96, 11241-1 1246; Topell et al, 1999, FEBSLett., 457, 283-289; Zhang et al, 1993, 
Biochemistry, 32, 12311-12318; Regan, L., 1999, Curr. Opin. Struc. Biol, 9, 494- 
499)and loop insertions (Abedi et al, 1998, Nucleic Acid Res., 26, 623-630; Nobuhide et 
al, 1999, FEBSLett., 453, 305-307). . 

[0009] The dissection and subsequent reassembh/of a protein from peptidic fragments 
provides an avenue for controlling its tertiary structure and hence its function. Although 
a majority of leucine zippers associate in a pa/llel fashion, recent examples of both 
naturally occurring and designed antiparall/leucine zippers have appeared in the 
literature (Lupas, A., 1996, Trends Bioch/m. Sc. 21, 375-382; Kohn, W. D. et al, 1997, 
S. J. Biol Chem. 272, 2583-2586; Bryson, J. W. et al, 1995, Science, 270, 935-941; 
Oakley M. G. et al, 1998, Biochemistry, 37, 12603-12610; Oakley, M. G. et al, 1997, 
Biochemistry, 36, 2544-2548). However, the prior art does not disclose the attachment of 
antiparallel leucine zippers to polypeptide fragments to form fusion proteins for 
reassembling the polypeptide fragments into functional proteins. 

[0010] In contrast to paralle/zippers, the antiparallel zippers are oriented in an opposite 
direction. Antiparallel Zippers have the advantage of occurring less frequently in natural 
proteins. Thus, antiparallel leucine zippers will interfere to a lesser extent with natural 
cellular proteins than parallel leucine zippers. Antiparallel attachment of leucine zippers 
to protein fragments (between a dissected peptide bond of the parent protein) requires a 
shorter amino acid linker region. As shown by the inventors of the present invention, as a 
preferred embodiment, a linker having 4-6 amino acids is sufficient (see Examples). 
Similar attachment of parallel leucine zippers would require > 10 amino acids to span the 
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necessary distance. The long unstructured linkers would be prone to proteolytic cleavage 
and be less stable in in vivo assays. 

[0011] Katz et al. (1998, Biotechniques, 25, 298) describe a targeting approach based on 
noncovalent heterodimerization of GFP and cytoplasmic structural proteins using a 
leucine zipper designed to form high-affinity heterodimers. The complexes localized 
accurately to specific sites within cells, providing selective fluorescence labeling of 
subcellular structures such as microfilaments or focal contacts. 

Protpin-Proteir Tt^prtion Assays 
[0012] The association and dissociation of protejrffis crucial to all aspects of cell 
I function. Examples of protein-protein interns are evident in hormones and their 
respective receptors, in intracellular and^acellular signalling events mediated by 
proteins, in enzyme substrate interac^ns, in intracellular protein trafficking, in the 
formation of complex structure^ ribosomes, viral coat proteins, and filaments, and in 
antigen-antibody interactions/ntracellular assays for detection of protein interactions 
and identification of their ^hibitors have received wide attention with the completion of 
the human genome sequence. 

[0013] US Patent 5,585,245 discloses a first fusion protein comprising an N-terminal 
subdomain of ubiquitin, fused to a non-ubiquitin protein or peptide and a second fusion 
protein comprising a C-terminal subdomain of ubiquitin, fused to the N-terminus of a 
non-ubiquitin protein or peptide. The patent discloses the use of these fusion proteins for 
studying protein-protein interactions. When contacted with one another, provided that the 
non-ubiquitin proteins or peptides interact (bind) with one another, the N- and C-terminal 
ubiquitin subdomains associate to reconstitute a quasi-native ubiquitin moiety which is 
recognized and cleaved by ubiquitin-specific proteases. However, this assay requires the 
use of additional cellular factors, such as the ubiquitin-specific proteases, for detection of 
protein-protein interaction. Thus, this assay is not feasible for high throughput screening 
of cDNA libraries. 
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[0014] US Patent 5,362,625 discloses omega-acceptor and omega-donor polypeptides 
(comprising about two-thirds and one-third of the p.galactosidase molecule amino and 
carboxyl termini, respectively), prepared by recombinant DNA techniques, DNA 
synthesis, or chemical polypeptide synthesis techniques, which are capable of interacting 
to form an active enzyme complex having catalytic activity characteristic of p- 
galactosidase. The patent also describes the use of these polypeptides in enzyme 
complementation assays for qualitative and quantitative determination of a suspected 
analyte in a sample. 

[0015] The yeast two-hybrid system for detecting protein-protein interactions in 
Saccharomyces cerevisiae (Fields and Song, 1989, Nature, 340:245-246; U.S. Pat. No. 
5,283,173 by Fields and Song) is well known in the art. This assay utilizes the 
reconstitution of a transcriptional activator like GAL4 (Johnston, 1987, Microbiol. Rev., 
51:458-476) through the interaction of two protein domains that have been fused to the 
two functional units of the transcriptional activator: the DNA-binding domain and the 
activation domain. This is possible due to the bipartite nature of certain transcription 
factors like GAL4. Being characterized as bipartite signifies that the DNA-binding and 
activation functions reside in separate domains and can function in trans (Keegan et al, 
1986, Science 231:699-704). The reconstitution of the transcriptional activator is 
monitored by the activation of a reporter gene like the lacZ gene that is under the 
influence of a promoter that contains a binding site (Upstream Activating Sequence or 
UAS) for the DNA-binding domain of the transcriptional activator. This method is most 
commonly used either to detect an interaction between two known proteins (Fields and 
Song, 1 989, Nature, 340:245-246) or to identify interacting proteins from a population 
that would bind to a known protein (Durfee et al., 1993, Genes Dev., 7:555-569; Gyuris ef 
al, 1993, Cell, 75:791-803; Harper et al, 1993, Cell, 75:805-816; Vojtek et al., 1993, 
Cell, 74:205-214). Like the ubiquitin system, additional factors are required for detection 
of the protein-protein interaction. Additionally, in the yeast two-hybrid system, the 
protein interaction must occur in the nucleus of the yeast. 
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[0016] WO 98/34120 describes protein fragment complementation assays for detecting 
bimolecular interactions. The assays comprise coexpression of fusion peptides consisting 
of N- and C-terminal fragments of murine DHFR fused to GCN4 leucine zipper 
sequences in E. coli to form colony. Colony formation only occurs when both DHFR 
5 fragments are present and contain leucine-zipper forming sequences. The published 
patent application contemplates the use of the assay to study molecular interactions 
including protein-protein, protein-DNA, protein-RNA, protein-carbohydrate, and protein- 
small molecule interactions, and for screening cDNA libraries for binding of a target 
protein with unknown proteins or libraries of small organic molecules for biological 
io activity. WO 98/34120 also contemplates the use of GFP in the protein fragment 
^ complementation assay. However, the published patent application does not suggest 

1 5 fusing antiparallel leucine zipper to DHFR or GFP for reconstitution. GCN4 disclosed in 

" the published application and routinely used by skilled artisan to reassemble proteins 

O especially in the yeast two hybrid system, is a parallel zipper. Antiparallel and parallel 

!^ 15 zippers orient proteins in opposite direction; thus, it is not predictable that an antiparallel 
j£| zipper can be substituted for a parallel zipper. 

1^ [0017] Additionally, all protein reassembly strategies disclosed in WO 98/34120 are for 

reassembly of multi domain proteins such as DHFR. The two dissected domains of 
DHFR can fold separately and only need to be brought into close proximity by attached 
20 proteins. There is no precedent for rational dissection of a single domain protein such as 
GFP that can be accomplished based upon the WO 98/34120. WO 98/34120 does not 
teach how to rationally dissect single domain proteins that can be subsequently 
reassembled. Finally, the ability to identify and characterize appropriate sites for 
dissecting a single domain protein is not validated or demonstrated in WO 98/34120. 
25 [001 8] US Patent 6, 1 80,343 relates to the use of fluorescent proteins, particularly green 
fluorescent protein (GFP), in fusion constructs with random and defined peptides and 
peptide libraries, to increase the cellular expression levels, decrease the cellular 
catabolism, increase the conformational stability relative to linear peptides, and to 
increase the steady state concentrations of the random peptides and random peptide 
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library members expressed in cells for the purpose of detecting the presence of the 
peptides and screening random peptide libraries. The patent does not contemplate the use 
of antiparallel leucine zipper for reconstituting GFP nor the use of peptides that associate 
with each other to reconstitute GFP and to provide a detection signal. 

5 

SUMMARY OF THE INVENTION 

[0019] The present invention provides protein complexes comprising a first and second 
3 peptide, each of said peptides being joined, operably linked, or fused to a heterologous 

S helical domain, said helical domains being noncovalently associated to form an 

fi 10 antiparallel leucine zipper. The peptides of the protein complexes form a functional 
« signaling moiety such as a reporter, a marker, or a biosensor upon non-covalent 

H association of the helical domains into an antiparallel leucine zipper. In one embodiment, 

each of the peptides is joined to a helical domain via a linker. In a preferred embodiment, 
S each of the helical domains comprises an amino acid sequence as set forth in SEQ ID NO: 

t 15 1 or SEQ ID NO: 2. Preferably, each of the first and second peptides comprises a distinct 
3 portion of green fluorescent protein (GFP). 

[0020] In one aspect, the present invention provides fusion proteins comprising a 
peptide and a helical domain, said helical domain forming an antiparallel leucine zipper 
when it noncovalently associates with a complementary helical domain. The helical 
20 domain is a heterologous or distinct protein or polypeptide fragment, relative to the 

peptide of the fusion protein. The fusion protein may further comprise a linker moiety 
interposed between the peptide and the helical domain. In a preferred embodiment, the 
peptide comprises a peptide derived from green fluorescent protein (GFP). 
[0021] In another aspect, the present invention provides nucleic acids encoding fusion 
25 proteins comprising a peptide and helical domain, said helical domain forming an 

antiparallel leucine zipper when it noncovalently associates with a complementary helical 
domain. 

[0022] The present invention provides a method of assembling a protein complex 
comprising (a) providing first and second helical domains that non-covalently associate 
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to form an antiparallel leucine zipper; (b) providing first and second peptides; (c) 
producing fusion proteins by separately fusing said first helical domain to said first 
peptide and said second helical domain to said second peptide; and, (d) allowing the 
fusion proteins to form a protein complex mediated by the non-covalent association of the 
5 first and second helical domains into an antiparallel leucine zipper. The first and second 
peptides are distinct peptides. Preferably, they are distinct peptides derived from GFP, 
such that they comprise different GFP fragments. 

[0023] In one embodiment of the disclosed method of assembling a protein complex, 
the protein complex comprises a signaling moiety and the helical domains comprise a 

10 leucine rich hydrophobic core. The helical domains may further comprise acidic residues 
and basic residues. The helical domains may further comprise a buried asparagine 
residue. The pair of helical domains preferably have the amino acid sequences as set 
forth in SEQ ID NO: 1 and SEQ ID NO: 2. In an alternative embodiment of the method, 
the step of producing the fusion proteins further comprises interposing a linker moiety 

15 between the peptide and the helical domain. 

[0024] The present invention also provides a method of identifying a polypeptide that 
interacts with a known polypeptide comprising (a) producing a first fusion protein 
comprising the known polypeptide linked to a first GFP fragment; (b) producing a second 
fusion protein comprising a test polypeptide linked to a second GFP fragment, wherein 

20 association of the first and second GFP fragments results in a GFP that exhibits detectable 
fluorescence; (c) allowing the first fusion protein to associate with the second fusion 
protein to form a complex mediated by the non-covalent association of the known 
polypeptide and test polypeptide; and, (d) detecting whether, or to what extent, 
association of first and second GFP fragments occcurs, wherein association of GFP 

25 indicates that the test polypeptide interacts with the known polypeptide. Preferably, the 
first GFP peptide is NGFP and the second GFP peptide is CGFP. 
[0025] In one aspect, the present invention provides a method of identifying a 
polypeptide that interacts with a known polypeptide comprising (a) producing a nucleic 
acid encoding a fusion protein comprising the known polypeptide linked to a first GFP 
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fragment; (b) producing a plurality of nucleic acids encoding fusion proteins comprising a 
test polypeptide linked to a second GFP fragment, wherein association of the first and 
second GFP fragments results in a GFP that exhibits detectable fluorescence; (c) 
cotransforming or cotransfecting the nucleic acids of steps (a) and (b) into a host cell for 
5 expression of the encoded fusion proteins; (d) selecting colonies that exhibit fluorescence; 
and, (e) culturing the selected colonies to identify the test polypeptides that interact with 
the known polypeptide. 
^ [0026] In a preferred embodiment of the constructs and methods of the present 

m invention, the first GFP peptide is NGFP and the second GFP peptide is CGFP. Also, 

gjj 10 preferably, the nucleic acids of step (b) of the foregoing identification step are produced 
^ in the form of a combinatorial library. 

\Q [0027] In another aspect, the present invention provides a method of identifying a 

s ~ molecule that inhibits the activity of a known protein comprising (a) producing a first 

1 3 fusion protein comprising a first known polypeptide linked to a first GFP fragment; (b) 

\*& 15 producing a second fusion protein comprising a second polypeptide linked to a second 
;£ GFP fragment, wherein the second polypeptide is known to interact with the first 

H polypeptide and wherein association of the first and second GFP fragments results in a 

GFP that exhibits detectable fluorescence; (c) allowing the first fusion protein to associate 
with the second fusion protein to form a GFP complex mediated by the non-covalent 
20 association of the first and second polypeptide; (d) incubating a test molecule with the 

GFP complex; and, (e) detecting disassembly of the complex, wherein disassembly of the 
complex indicates that the test molecule inhibits the activity of the known protein. 
Preferably, the first GFP peptide is NGFP and the second GFP peptide is CGFP. 
[0028] The present invention also contemplates a method of detecting protein-protein 
25 interactions comprising (a) producing a first fusion protein comprising a known 
polypeptide linked to a first GFP fragment; (b) producing a second fusion protein 
comprising a test polypeptide linked to a second GFP fragment, wherein association of 
the first and second GFP fragments results in a GFP that exhibits detectable fluorescence; 
(c) allowing the first fusion protein to associate with the second fusion protein to form a 
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complex mediated by the non-covalent association of the known polypeptide and test 
polypeptide; and, (d) detecting reassembly of GFP, wherein reassembly of GFP indicates 
that the test polypeptide interacts with the known polypeptide. 

[0029] A related method may further comprise obtaining nucleic acids encoding the first 
5 and second fusion proteins and cotransfecting or cotransforming the nucleic acids into a 
cell to obtain the first and second fusion protein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0030] Figure 1 shows the strategy for antiparallel leucine zipper directed protein 
10 reassembly of GFP (Kraulis, P. J. 5 1991, J. AppL Crystallog., 24, 946-950). Both the 
ribbon and topographical structures are depicted: GFP is shown in green, NZGFP is 
shown in blue and CZGFP is shown in red. The sequences of the designed leucine 
zippers, NZ and CZ, are ALKKELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 
1) and EQLEKKLQALEKKLAQLEWKNQALEKKLAQ (SEQ ID NO: 2) respectively. 

15 

[0031] Figure 2 shows fluorescence binding isotherm for the interaction of NZGFP 
with CZGFP monitored at 505 nm. Inset shows the normalized fluorescence excitation 
and emission of the reconstituted NZGFP.CZGFP complex. 

20 [0032] Figure 3 shows in vitro recbnstitution of GFP demonstrated by (a) green 

fluorescent BL21(DE3) cells and the corresponding SDS gels of (b) lanel: MW markers; 
lane 2: protein from cotransformed green colony; and lane 3: protein from colony 
containing only NZGFP plasmid and (c) lanel : MW markers; protein from cotransformed 
green colony; and lane 3: protein from colony containing only CZGFP plasmid. 



25 




[0033] Figure 4 shows the antiparallel leucine zipper pairs attached to CGFP and NGFP 
are shown in helical wheel representations/ The Lys (K) residues are colored blue and 
complementary Glu (E) residues are colored red. The pairs a and b are electrostatically 
matched and the pairs c and d are electrostatically mismatched. The inset shows restreaks 
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f single Escherichia c^/fcolonies corresponding to each pair. EK-CGFP is the same as 
ZGFP, and EK-N0FP is the same as NZGFP. 

[0034] Figures 5 A-C show fluorescence based selection. A. The "prey" leucine zipper 
attached to CGFP is randomized (X) at the e and g positions of the helix with either Lys 
(K) or Glu (E). and the "bait" leucine zipper attached to NGFP contains only Glu (E) 
residues at both e and g positions. B. SDS gel showing protein expression profiles of 10 
cotransformed green fluorescent colonies from the selection. C. Tabulation of the 
residues selected in the library leucine zipper (XX-CGFP) by screening for fluorescence 
of cotransformed Escherichia coli cells. 

[0035] Figure 6 shows fluorescence spectra for the dominant negative inhibition of 
NGFP/CGFP complex formation as a function of added EK leucine zipper inhibitor (SEQ 
ID NO: 1). Inset shows the inhibition profiles of the EK leucine zipper, NGFP, and 
CGFP peptides. 

DETAILED DESCRIPTION 

1. General Description 
[0036] The present invention is based on the fmdmg that the dissection and subsequent 
reassembly of a protein from peptidic fragments rfrovides an avenue for controlling the 
protein's tertiary structure and hence its function. 

[0037] The present invention is based in part on the surprising discovery of a general 
method for the reassembly of protein fragments mediated by the non-covalent association 
of antiparallel leucine zippers (Lupas, A., 1996, Trends Biochem. Sc. 21, 375-382; Kohn, 
W. D. et al. 9 1997, J. Biol. Chem. 272, 2583-2586; Bryson, J. W. et al. f 1995, Science, 
270, 935-941). Specifically, the present invention discloses a strategy for the noncovalent 
reconnection of the N- and C- termini of a dissected surface loop of a protein by means of 
antiparallel leucine zippers (Figure 1) (Kraulis, P. J., 1991,7. Appl Crystallog., 24, 946- 
950). The present invention demonstrates the successful application of this 
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oligomerization strategy, both in vitro and in vivo, to the 238 residue green fluorescent 
protein (GFP) from Aeguorea victoria (Tsien, R. Y., 1998, Annu. Rev. Biochem., 67, 
509-544). GFP provides an easily testable system for correct reassembly by virtue of its 
autocatalytically generated fluorescence, which is intimately linked to its properly folded 
5 structure (Ormo, M. et al., 1996, Science, 273, 1392-1395; Reid, B. G. et al, 1997, 
Biochemistry, 36, 6786-6791; Miyawaki, A. etal, 1997, Nature, 388, 882-887; 
Miesenbock, G. et al, 1998, Nature, 394, 192-195;). 

[0038] The present invention is also based in part on the discovery of an effective 
strategy involving linking fragments of an enzyme to potentially interacting protein- 
ic) partners such that functional enzyme reassembly only occurs on formation of a strong 

protein-protein complex. In one aspect, the present invention establishes the selectivity of 
the GFP reassembly mediated selection of interacting proteins (GRIP) assay and applies it 
to the in vivo colorimetric selection of complementary leucine zipper pairs from 
combinatorial libraries in Escherichia coli. In another aspect, the present invention 
15 demonstrates the applicability of the GRIP assay to monitor the disruption of protein- 
protein interactions by a dominant negative approach. Accordingly, the present invention 
provides an assay system that has the potential to monitor protein-protein interactions in 
their natural environment within a cell and are not limited to the nucleus as are classic 
yeast two-hybrid systems (Fields, S. et al, 1989, Nature, 245-246). 

20 

2. Definitions 

[0039] As used herein, "active protein complex" refers to a protein complex comprising 
two or more peptides and retaining substantially all the functional activity of the native 
protein from which the peptides are obtained. 
25 [0040] As used herein, "covalent bond" refers to an interatomic bond characterized by 
sharing of electrons. 

[0041] As used herein, "fusion protein" or "chimeric protein" refers to a hybrid protein, 
which consists of two or more proteins, or fragments thereof, linked together covalently. 
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A fusion protein may comprise two or more peptides or proteins from different animals, 
origins, or species. 

[0042] As used herein, "helical domain" refers to a protein or polypeptide fragment or a 
peptide having a a helix or a coiled configuration. 

[0043] As used herein, "heterologous protein or peptide" refers to a protein or peptide 
derived from a different origin, animal, or species. Heterologous proteins or peptides are 
not operably linked in their naturally occurring or native form. 

[0044] As used herein, "noncovalent association" refers to molecular interactions that 
do not involve an interatomic bond. Noncovalent interactions involve, for example, ionic 
bonds, hydrogen bonds, hydrophobic interactions, and van der Waals forces. 
Noncovalent forces may be used to hold separate polypeptide chains together in proteins 
or in protein complexes. 

[0045] As used herein, "protein complex" refers to a combination of two or more 
proteins into a larger molecule without covalent bonding. 

[0046] As used herein, "random peptide" refers to an oligomer composed of two or 
more amino acid residues and constructed by a means with which one does not preselect 
the complete sequence of a particular oligomer. 

047] As used herein, "randonipeptide library" or a "combinatorial library" refers a 
library comprising not only^f a set of recombinant DNA vectors (also called 
recombinants) that encpaes a set of random peptides, but also of random peptides 
encoded by those Wrtors, as well as the fusion proteins containing those random 
peptides. / 1 

[0048] As used herein, "signaling moiety" refers to a moiety that acts to cause an action 
such as a signal. The moiety may signal as a result of an enzymatic reaction, light 
absorption, or other means. 
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3. Specific Embodiments 

A. GFP as a System for Protein Reassembly and Fragment Complementation 

Assay 

0049] The present invention is based in part on the u^e of GFP as model for protein 
eassembly and fragment complementation based assays. GFP provides an ideal system 
for these assays because the reassembled protepi autofluoresces and is easily visualized 
and amenable to fluorescence activated cell sorting (Tsien, R. Y., 1998, Annu. Rev, 
Biochem. 67, 509-544; Misteli, T. et ah,\991, Nat. BiotechnoL 15, 961-964). GFP 
fluorescence does not require the addition of other cellular factors, substrates, or 
additional gene products from A vidoria. Moreover, GFP can be expressed and detected 
in various cells and organisms ancris not localized to a specific organelle of a cell upon 
expression. Additionally, unlike the DHFR assay, detection of GFP expression is not 
dependent upon survival or ddath of host cells. Nor is the expression of GFP dependent 
upon the addition of cofact^rs as in the P-galactoside assay or of other cellular 
components as in the ubiquitin assay. It is also not toxic to mammals and has been 
expressed in monkeys/chan et ah, 2001, Science, 291, 309). Further, the multiple 
variants of GFP available for use in different organisms and cell-types makes it an ideal 
protein candidate/for development of a general assay such as the GRIP assay described 
below. ' 

[0050] Various mutations in GFP leading to brighter emission following 488 nm 
excitation have been generated. Mutations in GFP which shift the excitation maximum 
from 395 nm to about 490 nm have been reported by Delagrave et ah (1995, 
Biotechnology, 13, 151) and Heim et ah (1995, Nature, 373, 663). Mutants with Ala, 
Gly, He, Cys or Thr substituted for Ser65 have large shifts in excitation maxima, and 
fluoresce more intensely than wild- type protein when excited at 488 nm. The mutation of 
Ser65 to Thr or Cys has been observed to increase by a factor of 6 the fluorescence of 
GFP following 488 nm excitation. Heim et ah (1994, Proc. Nat'lAcad. Sci USA, 91, 
12501-12504) describe a mutant that fluoresces blue arid contains a histidine in place of 
Tyr66. Delagrave et ah (1995, Bio. Technology, 13, 151-154) report on several Aequorea 
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GFP variants that showed red- shifted excitation spectra, i.e., shift in excitation maxima 
from 393 nm to 498 nm. Delagrave et al. hypothesize that co-expression of GFP and red- 
shifted GFP (RSGFP) will enable the analysis of two proteins or promoters per cell or 
organism. 

5 [0051] U.S. Patent 6,096,865 describes GFP mutants with improved solubility 

properties at higher temperatures and are able to fluoresce at 37 °C. Specifically, the 
patent provides a GFP mutant in which phenylalanine at original amino acid position 64 
is replaced by a leucine. This mutant has the ability to fluoresce at 37 °C. Other mutants 
with altered spectra are disclosed by Heim et al. (1994, Proc. Nat'l Acad. Sci USA, 91, 
10 12501-12504 and 1995, Nature, 373, 663). 

[0052] The present invention contemplates the use of various GFP mutants in the 
protein complementation assay and protein reassembly assay described in detail below. 
The preferred GFP mutant is the sglOO GFP variant described below. 



15 B. Methods for Reassembly of Fragments into a Functional Protein 

[0053] The present invention also is based in part on the discovery that an antiparallel 
leucine zipper is useful for in vitro reassembly of protein fragments into a functionally 
active protein. Specifically, a GFP variant (sglOO) which has a single excitation and 
emission maximum at 475 nm and 505 nm respectively, was dissected and refolded using 
20 an antiparallel leucine zipper. The GFP variant, sglOO, was dissected at a surface loop 
between residues 157 and 158. A pair of helices, NZ and CZ (SEQ ID NO: 1 and 2), 
capable of forming an antiparallel leucine zipper was designed and fused to the dissected 
GFP fragments via linkers to form NZGFP (N-terminal GFP) and CZGFP (C-terminal). 
Under conditions routinely used for folding denatured GFP, NZGFP and CZGFP 
25 reassembled properly to form a functionally active GFP. The wavelengths, A, m ax, for 
fluorescence excitation and emission spectra were identical to that of the parent GFP 
(Figure 2). 

[0054] The present invention is also bas^d in part on the discovery that an antiparallel 
(leucine zipper is useful for in vivo reassembly of protein fragments into a functionally 
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active protein. Specifically, equim^i&r amount of plasmids encoding NZGFP and CZGFP 
were transformed into E. coli ceHs. Colonies that turned green (Figure 3, panel a) were 
selected and further cultured^n liquid media for analysis of the protein expression pattern. 
As shown in figure 3, panels b and c 5 the green colonies expressed similar amounts of 
NZGFP and CZGFP, Vhereas the non- fluorescent colonies contained either NZGFP or 
CZGFP. Moreovecf ;ontrol cotransformation experiments with NGFP/CGFP, 
NGFP/CAFP, anfl NZFP/CGFP did not have any green colonies. Accordingly, the 
presence of berth NZ and CZ leucine zippers are required to mediate GFP assembly in 
vivo and inmtro. 

[0055] The present invention contemplates the use of the antiparallel leucine zipper to 
refold, reconstitute, or reassemble proteins from peptides. Moreover, the ability to 
reconstitute GFP from its peptide fragments can be extended to an in vivo fragment 
complementation assay for the selection of antiparallel leucine zippers as has been 
demonstrated for parallel leucine zippers with DHFR (Pelletier, J. N. et al, 1999, Nat. 
Biotechnol, 17, 683-690). As described below, fragmented GFP can be used to study the 
in vivo interaction of protein-protein pairs which have their N and C termini in close 
proximity (Pelletier, J. N. et al, 1998, Proc. Natl Acad: Sci. U.S.A., 95, 12141-12146). 
More generally, the protein reassembly strategy of the present invention may have 
applications such as the selective isotopic labeling of one fragment of a large protein for 
NMR analysis, or the mutagenesis of a limited region of a protein as demonstrated for 
inteins (Cotton, G. J. etal, 1999,./. Am. Chem. Soc, 121, 1100-1101; Cotton, G. J. etal, 
1999, Chem. Biol., 6, R247-R256; Muir, T. W. et al, 1998, Proc. Natl Acad. Sci. U.S.A., 
95, 6705-6710; Xu, R. etal, 1999, Proa Natl. Acad. Sci. U.S.A., 96, 388-393). Further, 
the engineering of an on/off switch for the activity of fragmented proteins by designing a 
leucine zipper heterodimer which can be reversibly assembled or disassembled by 
controlling the environmental conditions is also contemplated (Zutshi R. et al, 1998, 
Curr. Opin. Chem. Biol, 2, 62-66; Yao, S. et al, 1998, Nature, 396, 447-450; Krylov, D. 
etal, \99A,EMBOJ., 13,2849-2861). 
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C. GRIP Assay and Combinatorial Selection 
[0056] The present invention is also based on the selectivity of the GFP reassembly 
mediated selection of interaction proteins. Based on this selectivity, the present invention 
developed the GRIP assay (GFP reassembly mediated selection of interacting proteins or 
peptides) and applied the assay to the in vivo colorimetric selection of complementary 
leucine zipper pairs from combinatorial libraries. 

)057] Specifically, the inventors having established that the GRIP assay was selective 
fonhigh affinity LZ (leucine zipper) pairsytested the applicability of the assay in the 
combinatorial selection of LZ pairs that/would interact strongly enough to promote GFP 
Reassembly (Figure 5 A). This would/extend the GRIP system for selection of protein 
partners as had been demonstrated/lor other fragment reassembly systems (Pelletier, J.N. 
et aL, 1999, Nat BiotechnoL 1^683-690). A simple experiment in which the acidic LZ 
containing N-terminal GFP fragment (EE-NGFP) was kept constant was chosen. A 
library of LZ partners thatyould either code for Glu or Lys with equal probability at the e 
and g "specificity" positions (Figure 5A) was generated. This library was fused to the C- 
terminal GFP fragment y(XX-CGFP). The plasmid encoded library of XX-CGFP and EE- 
NGFP were cotransfonned into host cells, and colonies that exhibited fluorescence were 
selected and analyzed by sequencing. As expected, there was an overall enrichment of 
Lys residues as tha'selected parner for complementing the acidic EE-NGFP. The 
electrostatic pairing of Lys/Glu is required for stabilizing the leucine zipper. 

[0058] The present invention demonstrates that the GRIP assay is selective for specific 

/ 

protein pairs in vivo and is amenable for the selection of complementary protein pairs in 



vivo. 



l 



D. GRIP Assay and its Use in Detection of Inhibitors or Protein-Protein 
Interactions 

[0059] The present invention is further based in part on the discovery that the GRIP 
assay is useful for assaying the disruption of protein-protein interactions in vitro. The 
GRIP can be utilized for identifying inhibitors of protein-protein interactions. 
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Specifically, a LZ peptide (SEQ ID NO: 1) was incubated with NGFP/CGFP complex. 
The sample was monitored for fluorescence as a function of added peptide (Figure 6). 
The LZ peptide (SEQ ID NO: 1) prevented the assembly of the complex (4 \xM) with an 
IC 5 o value of 3 1 p,M. Control experiments with addition of either NGFP or CGFP 
5 fragments that lacked leucine zippers did not prevent reassembly of NZGFP/CZGFP 
complex (Figure 6). 

E. Applications of th e GRIP Assay 
k Q [0060] The present invention is based on the development of a visually detectable 

]Z 10 colorimetric system for studying the assembly and disassembly of protein partners. This 

I j 1 

i xJ system can be used for high-throughput screening, for example, screening using 

,S fluorescence activated cell sorting in yeast (Winson, M. K. et aL, 2000, Methods, 21, 231- 

^ 240 (2000).). Further, the system can be practiced using protein three-hybrid detection 

□ system, with two interacting proteins fused to respective fragments of a donor GFP 

^7 15 variant and a third protein fused to an acceptor GFP variant, thus allowing for in vivo 
■h fluorescence resonance energy transfer measurements (Tsien, R. Y., 1998, Annu. Rev. 

\1 Biochem., 67, 509-544; Pollok, B. A. et aL, 1999, Trends Cell Biol., 9, 57-60). 

[0061] In the GRIP assay, the emitted light can be analyzed by visual screening, a flow 
sorter (FACS), a spectrophotometer, a microtiter plate reader, a charge coupled devise 
20 (CCD) array, a fluorescence microscope, or other similar devices. 

[0062] The GRIP assay may be performed in using a multiwell format. Typically, wells 
are arranged in two dimensional linear arrays with greater than 864 wells on a standard 
microtiter plate footprint. Other commonly used numbers of wells include 1536, 3456, 
and 9600. Well volumes typically vary from 500 nanoliters to over 200 microliters, 
25 depending on well depth and cross sectional area. Well volumes of 1, 2, 5, 10, 20, and 50 
microliters are commonly used. Wells can be made in any cross sectional shape (in plan 
view) including, square, round, and hexagonal and combinations thereof. Wells can be 
made in any cross sectional shape (in vertical view), including shear vertical walls with 
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flat or round bottoms, conical walls with flat or round bottoms and curved vertical walls 
with flat or round bottoms and combinations thereof. 

[0063] US Patent 6,229,603 provides multi-well plates with greater than 864 wells that 
comprise a layer of cycloolefin having low fluorescence and high transmittance. These 
multi-well plates are particularly well suited for fluorescence measurements. 
[0064] The GRIP assay may be used to study protein-small molecule interactions. 
Alternatively, the assay may be used to investigate protein-protein interactions and to 
screen libraries for identification of binding molecules. Examples of protein-protein 
interactions include, but are not limited to, antigen/antibody,.ligand/receptor, antagonist 
or inhibitor/protein, binding protein/protein, and enzyme/substrate. 
[0065] Further, the GRIP assay may be used to investigate other macromolecular 
interactions. A known DNA or RNA binding protein, "A" (that binds a RNA or DNA 
sequence "X"), is fused to one fragment of GFP, for example NGFP, and a second 
putative RNA or DNA binding protein from library "Z" is fused to, for example CGFP. 
In an in vivo or in vitro system the DNA or RNA component ("X-Y") that is being 
assayed for will have the DNA or RNA sequence "X" attached to a second DNA or RNA 
sequence Y whose protein target is being sought from library "Z". When binding or 
complexing occurs between "X" and A-NGFP and "Y" with a protein from the X-CGFP 
library, fluorescence will be established. 

[0066] Variations upon this can be" used to identify carbohydrate-protein partners or 
small molecule protein partners by making appropriate changes in the NGFP fused 
protein A (which can be chosen to bind carbohydrate or small molecule components). 
[0067] This assay may also be used to investigata^ibraries of DNA, RNA, 
carbohydrates, peptides or other small molecule^: In this situation "X-Y" can be a 
library. "X" is held constant with a known DNA, RNA, carbohydrate, or small molecule 
that binds a protein, "A", and "Y" can be varied as desired. The fusion proteins A-NGFP 
and Z-CGFP can also be held constant, /y" is to identified and is a molecule that binds 
Z-CGFP. Establishing fluorescence will indicate identification of a DNA, RNA, 
carbohydrates, or small molecules component Y that binds protein Z. 
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F. Combinatorial Libraries 
[0068] A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis, by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. 
(J \ [0069] Preparation and screening of combinatorial chemical Htfraries is well known to 
10 persons of skill in the art. Such combinatorial chemical libraries include, but are not 
[ / limited to, peptide libraries (see, e.g., U.S. Pat. No. 5,01(U75, Furka, 1991, Int. J. Pept. 
/ Prot. Res. 37, 487) and Houghton et al, 1991, Nature 84). Other chemistries for 
generating chemical diversity libraries can also be us^d. Such chemistries include, but 
are not limited to: peptoids (PCT Publication No.^VO 91/19735), encoded peptides (PCT 
Publication WO 93/20242), random bio-oligom^rs (PCT Publication No. WO 92/00091), 
benzodiazepines (U.S. Pat. No. 5,288,514), djVersomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs et dl, 1993, Proc. Nat. Acad. Sci. USA 90, 6909), 
vinylogous polypeptides (Hagihara et a/71992, J. Amer. Chem. Soc. 1 14, 6568), 
nonpeptidal peptidomimetics with P-lVglucose scaffolding (Hirschmann et al., 1992, J. 
Amer. Chem. Soc. 114, 9217), analogous organic syntheses of small compound libraries 
(Chen et al, 1994, J. Amer. ChemJsoc. 116, 2661), oligocarbamates (Cho et al., 1993, 
Science 261, 1303), and/or peptidyl phosphonates (Campbell et al, 1994,7. Org. Chem. 
59, 658), nucleic acid librariesTpeptide nucleic acid libraries (U.S. Pat. No. 5,539,083), 
antibody libraries (Vaughn effal, 1996, Nature Biotechnology 14(3), 309 and 
PCT/US96/10287), carbohydrate libraries (Liang et al, 1996^ Science 274, 1520 and U.S. 
Pat. No. 5,593,853), small organic molecule libraries (benzodiazepines, Baum, C&EN 
January 18, page 33 (19^3); isoprenoids, U.S. Pat. No. 5,569,588; thiazolidinones and 
metathiazanones, U.S.yPat. No. 5,549,974; pyrrolidines, U.S. Pat. Nos. 5,525,735 and 
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5,519,134; morpholino compounds, U.S. Pat. No. 5,506,337; benzodiazepines, U.S. Pat. 
No. 5,288,514, and the like). 

[0070] Devices for the preparation of combinatorial libraries are commercially available 
(see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville Ky., Symphony, Rainin, 
Woburn, Mass., 433A Applied Biosystems, Foster City, Calif., 9050 Plus Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, 
Pa., Martek Biosciences, Columbia, Md., etc.). 

[0071] The small molecules of a smalj^nolecule combinatorial library may be selected 
from at least one of the group consisting of amino acids, peptides, oligonucleotides, and 
heterocyclic compounds. The nrfesent invention contemplates contemplates combinatorial 
libraries of small molecules^mat are naturally occurring or synthetic. 
[0072] Suitable peptides comprise as few as two amino acids to as many as about 30; 
preferably, suitable peptides comprise from about two amino acids to about fifteen; most 
preferably, suitable peptides comprise from about two amino acids to about ten. Any 
amino acid may be incorporated into peptides screened and identified using the present 
invention, including any combination of the naturally occurring proteinogenic amino 
acids as well as amino acids not naturally occurring in proteins such as, but not limited to, 
dextrorotatory forms of the known amino acids, for example. 

[0073] Suitable oligonucleotides consist of as few as two nucleotides to as many as 
about 50; preferably, suitable oligonucleotides consist of from about five nucleotides to 
about 30; most preferably, suitable oligonucleotides consist of from about five 
oligonucleotides to about 15. Any nucleotide may be incorporated into an 
oligonucleotide to be screened and identified using the present invention, including any 
combination of the naturally occurring deoxyribonucleotides and ribonucleotides as well 
as those not naturally occurring in biological systems, such as, but not limited to, H- 
phosphonate derivatives^N-blocked-S^O-DMT-deoxynucleoside 3 ? -(2-cyanoethyl-N,N- 
diisopropyl)phosphoramidites, N-blocked-S'-O-EiMT-deoxynucleoside 3'-(2-cyanoethyl- 
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N,N-diisopropyl)phosphoramidites, N-blocked-5'-0-DMT-deoxynucleoside 3'-(methyl- 
N,N-diisopropyl) phosphoramidites, N-blocked-5-O-DMT-deoxynucleoside 3 , -(2- 
chlorophenyl) phosphates, N-blocked-S'-O-DMT-deoxynucleoside 3 f -(2-chlorophenyl 2- 
cyanoethyl) phosphate, all of which are nucleoside derivatives used in oligonucleotide 
synthesis. 

10074] Suitable heterocyclic compounds consul of, at minimum, a single four 
/membered ring to as much as a multiple of ftfur membered or greater membered rings 
coupled by carbon chains of 1 to about 20 atoms in length, such chains being saturated or 
not. Preferably, suitable heterocyclic^ompounds include a single four- to seven- 
membered ring, as well as, but not/fimited to varying combinations of 5, 6, or 7 
membered rings having varying/humbers of N, S, or O atoms. Examples of suitable 
heterocyclic compounds inclufle benzodiazepine and derivatives thereof (as, for example, 
disclosed in Bunin etaL, 1992, J. Am. Chem. Soc. 114, 10997),_penicillins, 
cephalosporins, and folate derivatives. 

[0075] For ease of identification, the molecules in a small molecule combinatorial 
library may be tagged for decoding their identity. 

IQ076J The GRIP assay may be used to screen mixed libraries. Mixed libraries of small 
molecules comprising amino acids, peptides, oligonucleotides, and heterocyclic 
compounds that are 5'-hydroxyl derivatives of the oligonucleotides may be used. The 
peptide end of members of a peptide library can be modified to include a carboxyl group. 
A process of esterification of the carboxyl group with the 5 f -hydroxyl of the 
oligonucleotide is used to produce a mixed library containing peptide-oligonucleotide 
species. Brenner et aL, (1992, Proc. Nat'l Acad. ScL USA 89, 5381) also describes a 
method of preparation of mixed libraries having nucleotides and peptides. A mixed 
library comprising a heterocyclic compound and a peptide is also prepared by the reaction 
of suitable functional groups present on the heterocyclic compound. For instance, the 
carboxyl group on a heterocyclic compound is reacted with the amino group on the 
peptide to provide an amide linkage. 
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[0077] Preferably, the GRIP assay of the present invention is used to screen peptide 
libraries. A comprehensive review of various types of peptide libraries can be found in 
Gallop et al. 9 1994, J. Med. Chem. 37:1233-1251. The use of peptide libraries is well 
known in the art. Peptide libraries have generally been constructed by one of two 
5 approaches. 

[0078] In the first approach, peptides have been chemically synthesized in vitro in 
several formats. For example, Fodor et al (1991, Science 251, 767) describe use of 
complex instrumentation, photochemistry and computerized inventory control to 
jfj synthesize a known array of short peptides on an individual microscopic slide. Houghten 

!™ 10 et al. (1991, Nature, 354, 84) describe mixtures of free hexapeptides in which the first 

\il and second residues in each peptide were individually and specifically defined. Lam et 

fo 

.q aL (1991, Nature 354, 82) describe a "one bead, one peptide" approach in which a solid 

phase split synthesis scheme produced a library of peptides in which each bead in the 

□ collection had immobilized thereon a single, random sequence of amino acid residues. 

in 

1*2 15 [0079] In the second approach, peptides are expressed in biological systems as either 
=F soluble fusion proteins or viral capsid fusion proteins. A number of peptide libraries have 

\1 been generated using the Ml 3 phage. Ml 3 is a filamentous bacteriophage that has been 

routinely used in molecular biology laboratories for the past 20 years. M13 viral particles 
consist of six different capsid proteins and one copy of the viral genome, as a single- 
20 stranded circular DNA molecule. Once the M13 DNA has been introduced into a host 
cell such as E. coli, it is converted into double-stranded, circular DNA. The viral DNA 
carries a second origin of replication that is used to generate the single-stranded DNA 
found in the viral particles. During viral morphogenesis, there is an ordered assembly of 
the single-stranded DNA and the viral proteins, and the viral particles are extruded from 
25 cells in a process much like secretion. The M13 virus is neither lysogenic nor lytic like 
other bacteriophage {e.g., X)\ cells, once infected, chronically release virus. This feature 
leads to high titers of virus in infected cultures, i.e., 10 12 pfii/ml. 
[0080] In a preferred embodiment, a GFP peptide comprising a fragment of GFP is 
fused to a random peptide to form a fusion polypeptide. By "fused" or "operably linked" 
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herein is meant that the random peptide and the GFP, are linked together, in such a 
manner as to minimize the disruption to the stability of the GFP structure, i.e. it retains 
fluorescence. The GFP fusion polypeptide of the present invention can comprise further 
components such as linkers or fusion partners. 
5 [0081] The peptides (and nucleic acids encoding them) are randomized, either fully 

randomized or they are biased in their randomization, e.g. in nucleotide/residue frequency 
generally or per position. As used herein "fully randomized" means that each nucleic 
acid and peptide consists of essentially random nucleotides and amino acids, respectively. 
The nucleic acids which give rise to the peptides are chemically synthesized, and thus 
10 may incorporate any nucleotide at any position. Thus, when the nucleic acids are 

expressed to form peptides, any amino acid residue may, be incorporated at any position. 
The synthetic process can be designed to generate randomized nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the nucleic acid, 
thus forming a library of randomized nucleic acids. 
15 [0082] Alternatively, the peptide library is biased. In this case, some positions within 
the sequence are either held constant, or are selected from a limited number of 
possibilities. Individual residues may be fixed in the random peptide sequence to create a 
structural bias. For example, proline or bulky residues such as W, R, K, L, I, V, F or Y 
may be inserted to restrict the conformation of the peptide. Also, the library can be 
20 biased to a particular secondary structure such as the alpha-helical structure. Examples of 
helix forming residues include M, A, K, L, D, E, R, Q, F, I, and V. 
[0083] In a preferred embodiment, the bias istoward peptides that interact with the 
known classes of molecules. For example/it is known that SH-3 peptides binds to SH-3 
proteins. A large number of small molecule domains are known that are suitable as 
25 starting points for the generation of biased randomized peptides. Examples of such 
molecules, domains, or consensusysequences include, but are not limited to SH-2 
domains, SH-3 domains, pleckstnn, death domains, protease cleavage/recognition sites, 
enzyme inhibitors, enzyme substrates, and Traf, and leucine zipper consensus sequence. 
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[0084] As discussed above, a fusion partner or linker can be added to fuse the random 
peptides to a GFP peptide. Fusion partners or linkers can be synthetic or heterologous 
(not native to the host cell). Appropriate fusion partners include, but are not limited to 
peptides that are stability sequences that stabilize and protect the random peptide from 
5 degradation, linker sequences for decoupling the random peptide from the GFP fragment, 
structural sequences that restrict and stabilize the conformation of the random peptide, 
targeting sequences which allow localization of the peptide into a subcellular or 
extracellular compartment, and rescue sequences that allow the purification or isolation of 
iQ the random peptide. 

: ^ 10 [0085] In light of the foregoing general discussion, the specific examples presented 

iy below are illustrative only and are not intended to limit the scope of the invention. Other 

in 

generic and specific configurations will be apparent to those persons skilled in the art. 

□ EXAMPLES 

i n 

15 General Materials and Methods 

*h [0086] GFP Variant: A variant of the naturally occurring GFP, which has a single 

: : 

excitation maximum at 475 nm was chosen for dissection and reassembly. The GFP 
variant (sglOO) contains F64L, S65C, Q80R, Y151L, I167T and K238N mutations from 
wild type GFP, which leads to a single fluorescence excitation and emission maximum at 
20 475 nm and 505 nm respectively, similar to GFP-sg25 as described by Palm, G. J et al, 
1997, Nat. Struct. Biol., 4, 361-365. 

[0087] Cloning and Purification Protocol for NZGFP y NGFP, CGFP, and CZGFP: 
The NZGFP, NGFP, CGFP and CZGFP coding DNA were obtained by PCR 
25 amplification of the GFP (sglOO) plasmid template using appropriate primers. The DNA 
fragments were cut with Nhel/BamHl and ligated into the pETl la vector. The DNA 
sequences of the NZGFP, NGFP, CGFP, and CZGFP containing clones were verified by 
dideoxyoligonucleotide sequencing at the Keck facility at Yale. The protein products 
were overexpressed in BL21(DE3) cells at 37 °C without IPTG induction. The cells were 
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lysed by sonication and the proteins were individually purified by passage over 2 
successive Q-sepharose columns and then over a Gel-filtration column. Fractions 
containing the protein of interest, as determined by 'SDS-PAGE, were pooled and 
dialyzed against 2mM DTT, 10 mM TrisHCl buffer at pH 7.2. Final purified yields of 
5 proteins were between 10-20 mg/ L. Protein molecular weights were verified by MALDI 
mass spectrometry to within 0.05% of the calculated molecular weight. Amino acid 
analysis of the proteins established the correct compositions and protein concentrations 
for further biophysical studies. 

10 [00881 Amino acid sequences ofNGFP, NZGFP, CGFP, and CZGFP: Leucine zippers 
are in bold and linker regions underlined. Note the 6 residue linker between the C- 
terminal ofNGFP and NZ and the 4 residue linker between CGFP and CZ. 

NGFP 

15 [0089] MASKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIC 
TTGKXPVPWPTLVTTLCYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKD 
DGNYKTRAEVKFEGDTLVNRIELKGIDFK^DGMLGHKLEYNYNHNVLDvIADKQ 
GGSGSG (SEP ID NO: 3) 

20 NZGFP 

[0090] MASKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIC 
TTGKLPVPWPTLVTTLCYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKD 
DGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGMLGHKLEYNYNHNVLIMADKQ 
GG5G5GALKKELOANKKELAQLKWELQALKKELAQ (SEQ ID NO: 4) 

25 

CGFP 

[0091] MAS£zGjS<2KNGIKVNFKTRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNH 
YLSTQSALSKDPNEKRDHMVLLEFVTAAGlf HGMDELYN (SEQ ID NO: 5) 
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[0092] CZGFP 

MASEQLEKKLQALEKKLAQLEWKN 

EDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVT 
AAGITHGMDELYN (SEQ ID NO: 6) 

[0093] Additional Constructs for GFP Reassembly: DNA constructs for EE-NGFP and 
KK-CZGFP coding DNA were obtained by PCR amplification of the GFP (sglOO) 
plasmid template using appropriate primers encoding the leucine zippers KK and EE 
whose sequences are AQLKEKLQALKEKLAQK WKLNALKEKLAQ (SEQ ID NO: 7) 
and ALEKELQANEKELAQLEWELQALEKELAQ (SEQ ID NO: 8) respectively. The 
DNA fragments were digested with Nhel/BamHl (New England Biolabs) and ligated into 
the pETl la vector. The DNA sequences of the EE-NGFP, and KK-CGFP containing 
clones were verified by automated sequencing at the Keck facility at Yale. 

[Q094] Constructs for Library Selection: For leucine^per library construction two 
overlapping degenerate oligonucleotides containing'NAG (N= G or A) at all positions 
Corresponding to Lys in the leucine zipper of KK-CGFP were synthesized such that they 
r would code for either Lys or Glu with equal^probability. The two overlapping 

oligonucleotides were mutually primecLrind extended using T7 Sequenase (Amersham) 
with 10 mM dNTPs. The product vms purified from an agarose gel and subsequently 
ligated into the Nhel-DraHI (Nejv England Biolabs) cassette present in a previously cut 
KK-CGFP plasmid. The resulting library, XX-CGFP, was transformed in 5 x 50 \iL of 
electrocompetent XLl-Bhfe cells (Stratagene) and selected for ampicillin resistance. The 
resulting pool of XX-CGFP plasmids was sequenced to verify that G/A were equally 
represented at sites jzff randomization. 

[0095] Colorometric Selection: For all re^sembly experiments with NZGFP/CZGFP, 
NZGFP/KK-CGFP, EE-NGFP/CZGFPyfad EE-NGFP/KK-CGFP: 1 jag of each plasmid 
was cotransformed in 30 |iL of BL2^(DE3) cells and selected on ampicillin containing 
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LB plates. The plates were incubated at 37 °C overnight and subsequently moved to the 
bench top (23 °C) for 2 day. The green color developed after lp-32 hours. The 
cotransformation efficiency was approximately 7 ± 2 % as verified by growing up 
individual colonies and monitoring protein expression profiles, which corresponded well 
5 with visual inspection of green colonies in experiments with NZGFP/CZGFP and EE- 
NGFP/KK-NGFP. Non-fluorescent colonies that coexpressed either NZGFP/KK-CGFP 
or EE-NGFP/CZGFP were identified by screening L20 colonies of respective 
= cotransformations by SDS gel for protein expression of both gene products. In library 

*0 selections, 20 individual cotransformations of Ljlg of XX-CGFP library plasmid with 1 

in 10 \xg of EE-NGFP plasmid were carried out as described above. Sixteen colonies were 
;j? selected from 102 green colonies of ~4000/iotal colonies. The colonies were grown 

*D overnight in LB media and the plasmid yNA (XX-CGFP + EE-NGFP) purified and 

sequenced using primers unique to the/XX-CGFP construct, 
y Inhibition of Protein-Protein Interactions: The protein products for NZGFP, CZGFP, 

1=5: 15 NGFP and CGFP were overexpres^ed in BL2 1 (DE3) cells at 37 °C and purified as 
jE described above. Amino acid analysis of the proteins established the correct compositions 

1^ and protein concentrations for fluorescence experiments. The inhibitor peptide 

corresponding to the leucine Zipper of NZGFP (EK peptide) having the sequence 
ALKKELQANKK^LAQLKWELQALKKELAQ (SEQ ID NO: 1) was synthesized at the 
20 Keck facility (Y ale University) and purified on a reverse phase C8 column(Vydac) by 

HPLC. Peptide concentrations were determined by Tip absorbance and verified by amino 
acid analysis. / 

[0096] For inhibition experiments all fluorescence measurements were made in 
triplicate on a Hitachi F-4500 Fluorescence Spectrophotometer with excitation at 475 nm 
25 and emission at 505 nm. A 1.2 mM stock solution of equimolar amounts of 

NZGFP/CZGFP was allowed to reassemble and fluoresce until there was no change in 
fluorescence (36 hours). The reassembled complex was denatured in 4 M GdmHCl for 4 
hours following which different concentrations of EK peptide, NGFP, or CGFP were 
added and the NZGFP/CZGFP concentration adjusted to 800 jaM. The samples were 
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diluted 200 fold into 2 mM DTT, 10 mM Tris.HCl buffer at pH 7.2 to a 20 mM final 
concentration of Gdm.HCl and allowed to refold and fluoresce. In order to eliminate 
artifacts from time dependent inhibition, fluorescence measurements were made after 4 
hours and after 16 hours and found to be constant. 

Example 1: In Vitro Reassembly of GFP Using an Antiparallel Leucine 

Zipper 

1 . Design of Antiparallel Leucine Zipper 
[0097] Designs for helices, designated NZ and CZ, to form antiparallel leucine zippers 
for reassembly purposes were based upon sequences reported by Hodges, (11a) Kim, 
(Oshea, E. K. et al., 1993, Current Biol., 3, 658-667) and Alber (Harbury, P.B.et al., 
1994, Nature, 371, 80-84). The leucine zippers contained a Leu-rich hydrophobic core, 
acidic (Glu) and basic (Lys) residues to direct antiparallel heterodimer formation, and 
also incorporated a buried asparagine residue which disfavors homodimerization by up to 
2.3 Kcal/mol (Figure 1) (Oakley M. G. et al; 1998, Biochemistry, 37, 12603-12610). 

2. Dissection of GFP / 
^098] The variant GFP (sglOO) wa/dissected at a surface loop between residues 157 
and 158, a position that has previgfusly been shown to accommodate a 20 residue amino 
acid insertion ( Abedi - M R ,- e ; al/\99%, Nm * aeAl *>^^ 2 ^ 3m ' The dissection resulted in N- 
and C-terminal fragments^designated NGFP and CGFP, containing 157 and 81 residues, 
respectively (Figure 1). /The NGFP fragment contains the three residues, Ser65, Tyr66, 
and Gly67, that ultimately form the GFP fluorophdre (Tsien, R. Y., 1998, Annu. Rev. 
Biochem., 67, 509-/44). 

3 Tn Vitro Reassembly of th e Pi ssected GFP Fragments Using the Designed 

Helices 

[0099] The designed helix, NZ was appended to the C-terminal of NGFP, via a six 
residue linker, to generate the fusion peptide designated NZGFP. Similarly, CZ was 
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appended to the N-terminal residue of CGFP, via a four residue linker, to generate the 
complementary fusion peptide, CZGFP. 

[0100] It was envisioned that if NZGFP and CZGFP were competent to heterodimerize 
via the designed helices, either in vitro or in vivo, the reconstituted GFP protein would 
5 display its characteristic fluorescence, indicating the correct reassembly of the tertiary 
fold from the peptide fragments. The genes encoding the designed protein sequences 
NZGFP, CZGFP, NGFP, and CGFP were cloned and the resulting proteins overexpressed 
and purified using methods routinely practiced by the skilled artisan. 
yd [0101] To investigate the viability of the protein reassembly strategy, a literature 

|B 10 protocol devised for the refolding of denatured GFP was followed (Reid, B. G. et al, 

lij 1997, Biochemistry, 36, 6786-6791). Thus, equimolar amounts (4 |J.M) of the fragments, 

- '-~ 

% NZGFP and CZGFP, were denatured in 6 M GdrhHCl and dialyzed into a buffer 

k% i containing 2 mM DTT, 10 mM phosphate buffer at pH 7.2 over 24 hrs at 4 °C. The 

P reassembled peptides were visibly green. Moreover the A max for the fluorescence 

1^ 15 excitation and emission spectra were identical to that of the parent GFP (Figure 2 inset). 
;F To verify that the reassembly was indeed guided by the antiparallel leucine zippers, 

\^ control experiments were done with fragments with and without the leucine zippers. It 

was found that solutions containing NGFP, CGFP, NGFP/CGFP, NZGFP/CGFP, or 
NGFP/CZGFP did not fluoresce, even' at concentrations of over 100 |J.M. The apparent 
20 dissociation constant, Kd apP , for the NZGFP/CZGFP complex was determined by titrating 
NZGFP into a solution of CZGFP and monitoring the fluorescence emission intensity at 
505 nm (Figure 2). The data were fitted to a two-state binding isotherm, yielding a K<japp 
of 31 ± 7 nM and a-analysis of the binding data verified the expected 1:1 stoichiometry 
of NZGFP and CZGFP (Bagshaw, C R.; et al\ 1987, Spectrophotometry and 
25 spectrofluorimetry: A practical approach, pp 91-1 13). 

Example 2: In Vivo Reassembly of the Dissected GFP Fragments 
[0102] BL21(DE3) E. coli cells were transformed with equimolar amounts of NZGFP 
and CZGFP encoding plasmids. The appearance of green color was monitored to identify 
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cotransformed colonies expressing reassembled GFP. After 36 hours several of the 
colonies turned green as illustrated in Figure 3 a. with a cotransformation efficiency of 
4%. Individual colonies were cultured in liquid media and their protein expression 
pattern analyzed. The green colonies were shown to express similar amounts of NZGFP 
5 and CZGFP (Figure 3b and 3c), whereas non-fluorescent colonies were shown to contain 
either NZGFP or CZGFP. Furthermore, control cotransformation experiments with 
NGFP/CGFP, NGFP/CZGFP and NZGFP/CGFP failed to show any green colonies, thus 
emphasizing the requirement for the presence of both NZ and CZ leucine zippers to 
mediate GFP assembly in vivo and in vitro, 

10 

Example 3: Reassembly of Proteins Using Antiparallel Leucine Zipper 
[0103] The methods described above for reassembly of GFP in vivo and in vitro may be 
modified for reassembly of any protein of interest, using antiparallel leucine zippers. The 
particular proteins are not critical, so long as they can be divided into fragments that 
15 produce a detectable signal upon their association, specific binding, or complexation 
mediated by the formation of an antiparallel zipper with a known biological activity or 
function that can be assayed for in vitro or in vivo, for example, kinase activity for a 
protein kinase, proteolytic activity for a protease, and DNA binding activity of DNA 
binding protein. 

20 [0104] The peptide fragments of the protein of interest are fused to each of the helices 
(SEQ ID NO: 1 and SEQ ID NO: 2) described above. Alternatively, other pairs of helices 
that form antiparallel leucine zippers may be designed and fused to the peptide fragments 
of the protein of interest. 

25 1. In Vitro Reassembly 

[0105] Equimolar amounts of the fusion peptides comprising peptides of the protein of 
interest and helices that form antiparallel leucine zippers are denatured and dialyzed as 
described in Example 1. The reconstitution of the protein is monitored. 
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2. In Vivo Reassembly 
[0106] Equimolar amounts of plasmids encoding the fusion peptides are transformed in 
host eucaryotic or procaryotic host cells as described in Example 2. The cotransformed 
colonies expressing reassembled protein are identified. 

Example 4: The GRIP Assay 
[0107] In order to test the specificity, of the GRIP assay, a set of four possible LZ 
combinations which were either electrostatically matched (EE-NGFP/KK-CGFP and 
NZGFP/CZGFP) or mismatched (EE-NGFP/CZGFP and NZGFP/KK-CGFP) (Fig. 4; 
Bryson, et al, 1995, Science 270, 935-941, Oakley et al, 1998, Biochemistry 37, 12603- 
12610) were designed. The GRIP assay would only allow the matched pairs to 
competently fold and catalyze fluorophore formation in GFP. 

[0108] Escherichia coli (BL21) cells were cotransformed with plasmids encoding the 
proteins of interest and plated on ampicillin containing plates. Fluorescent colonies were 
observed only in the complementary pairs (EE-NGFP/KK-CGFP and NZGFP/CZGFP). 
No visible fluorescence was observed in colonies containing the uncomplementary pairs 
(EE-NZGFP/CZGFP and NZGFP/KK-CGFP). Since the electrostatically mismatched 
pairs have a dissociation constant, K<i, of ~ 100 \\M (Yao, S., et al, 1998, Nature 396, 
447-450), this experiment sets an initial lower visual limit for detecting protein-protein 
interactions using the GRIP assay. 

Example 5: GRIP Assay and Combinatorial Selection 
[0109] A library of LZ partners that could either code for Glu or Lys with equal 
probability at the e and g "specificity" positions (Fig. 5 A) was generated (Oakley, M. G. 
25 et al, 1998, Biochemistry 37, 12603-12610; Pelletier, J.N. et al, 1999, Nat. BiotechnoL 
17, 683-690; Dmitry, K., et al, 1994, EMBOJ. 13, 2849-2861). This library was fused to 
the C-terminal GFP fragment (XX-CGFP). It is thought that the selected partners would 
be enriched in Lys in order to complement the acidic EE-NGFP. 
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[0110] The 256-member plasmid-encoded library of XX-CGFP was cotransformed with 
EE-NGFP and selected colonies that exhibited fluorescence. The protein expression 
profiles of the two protein fragments, XX-CGP and EE-NGFP, were virtually identical in 
cotransformed cells (Fig. 5B), thus excluding differences in relative protein concentration 
5 as a major determinant of the observed fluorescence. Sixteen of the multiple colonies 
exhibiting fluorescence were sequenced. The results of the selection are summarized in 
Figure 5C. The selected LZ partners of EE-NGFP displayed an overall 3:1 ratio of 
Lys:Glu residues, with the fewest Lys residues being 5 and the most being 7. Thus, an 
overall enrichment of Lys residues was observed as predicted from the requirement for 

10 electrostatic pairing of Lys/Glu for stabilizing the leucine zipper. Assuming an average 
value of 0.85 kcal/mol penalty for each Glu-Glu pair relative to Lys-Glu pair based on 
literature precedence, (Dmitry, K., et aL, 1994, EMBO J. 13, 2849-2861; Zhou, N. E. et 
al, 1994, Protein Eng. 7, 1365-1372), a 75 fold difference in Ka between the best with all 
Lys (Kd = 33 nM) (Ghosh, I. et al 9 2000, J, Am. Chem. Soc. 122, 5658-5659) and worst 

15 (5 Lys and 3 Glu) leucine zipper partners for EE-NGFP was estimated. Thus, this 

experiment sets a lower threshold for the visual detection of interacting proteins in the 
GRIP assay to - 2.5 |oM, which is within the observed dissociation constant for most 
specific protein partners. 

20 Example 6: Inhibition of Protein-Protein Interactions 

[0111] To verify that the GRIP assay could be utilized for detecting inhibitors of 
protein-protein interactions (Zutshi, R., et aL, 1998, Curr. Opin. Chem. Biol 2, 62-66), a 
LZ peptide corresponding to the LZ present in NZGFP (Fig. 4) was synthesized. The 
reappearance of fluorescence of the disassembled NZGFP/CZGFP complex in an in vitro 

25 assay as a function of added peptide was monitored (Fig. 6). The EK peptide (SEQ ID 
NO: 1) prevented the assembly of the complex (4 fiM) with an IC50 value of 3 1 jiM. 
Control experiments with addition of either NGFP or CGFP fragments that lacked leucine 
zippers did not prevent reassembly of NZGFP/CZGFP complex (Fig. 6). It is worth 
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noting that disassembly of an existing GFP complex was not achievable even at > 1 mM 
added peptide inhibitor. 

Example 7: GRIP Assay and Identification of Binding Partners Via 
5 Combinatorial Selection 

[01 12] The GRIP assay may be modified by substituting the helices that form 
antiparallel leucine zippers with test proteins or peptides to determine whether a test 
protein or peptide attached to one portion of GFP interacts with another test protein or 

iP peptide attached to the other portion of GFP. The test proteins can be any protein. As an 

]% 10 example, an orphan receptor can be fused to one portion of the GFP, while test ligands 

I J can be fused to the second portion of GFP. 

[0113] Specifically, nucleic acid encoding a fusion protein comprising an orphan 

^ receptor and a first portion of the GFP and a plasmid library of fusion proteins comprising 

!3 test ligands and the second portion . of GFP can be cotransfected or cotransformed into 

W 

t]2 15 host cells. Colonies exhibiting fluorescence are selected, since they contain GFP 

= F molecules that have been properly folded or reassembled, and test ligands that interact 

13 .... 

with the orphan receptor. The colonies can be further cultured and investigated to 

determine the structural properties of the ligand. The molecular weight of the ligand may 

be determined by SDS-PAGE, and the primary structure may be determined by amino 

20 acid sequencing. 

[0114] Examples of orphan receptor groups include but are not limited to CCRL2, 
CMKLR1, CMKRL2, GPR31, HM74, and RDCL Specific examples of orphan receptors 
of each group include but are not limited to: 1) CCRL2: chemokine (C-C motif) 
receptor-like 2, HCR, CRAM-B, CKRX, CRAM-A, lipopolysaccharide inducible C-C 

25 chemokine receptor related, E01; 2) CMKLR1 : chemokine-like receptor 1, ChemR23, 
CMKRL3, DEZ, CMKLR1, LOC60669: G-proteih coupled chemoattractant-like 
receptor; 3) CMKRL2: chemokine receptor-like 2, CMKRL2, FEG-1, GPCR-BR, 
DRY 1 2, CEPR, GPR30, GPR4 1 ; 4) GPR3 1 : G protein-coupled receptor 3 1 , GPR3 1 , 
Gpr31b; 5) HM74: putative chemokine receptor, GTP-binding protein; and 6) RDC1 : 
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chemokine orphan receptor, D2S87E, GPRN1, CMKOR1, canine orphan receptor RDC1 
homolog, chemokine orphan receptor 1, Rdcl. 

[0115] The GRIP assay may be modified to detect macromolecular interactions, for 
example, specific protein-protein interactions, both in vitro and in vivo. When the 
proteins attached to the two GFP fragments associate with each other, the two GFP 
fragments will properly reassemble and fluoresce. Thus, in the absence of association, 
the proteins attached to the GFP fragments do not fluoresce. Fluorescence of interacting 
protein pairs linked to NGFP and CGFP can provide a sensitive assay for detecting the 
affinity and specificity of the individual protein pairs (and their mutants) under 
investigation. 

[0116] Examples of protein-protein interactions include, but are not limited to, 
antigen/antibody, ligand/receptor, antagonist or inhibitor/protein, binding protein/protein, 
and enzyme/substrate. Specific protein-protein interactions involved in disease and 
identified as potential drug targets include examples such as Bax/Bcl-2 (Sartorius, et al. 9 
2001, Chembiochem, 2 (1), 20), p53/mdm2 (Moll et al, 2000, Drug Resist Update, 3 (4), 
217)), VEGF/VEGF-R (Plate et al, 1992, Nature, 359, 845), IL-6/IL-6R (Akira et al, 
1993, Adv. Immunol, 54, 1), Ras/Raf (Weinstein-Oppenheimer et aL, 2000, Pharmacol 
sTher., 88(3), 229). 

ml 17] Examples of otWmacromolecular interactions include, but are not limited to, 
nucleic acid-nucleic a(^id binding protein interactions and carbohydrate-protein 
interactions, / 

Example 8: GRIP Assay and Identification of Inhibitors Via Combinatorial 
Selection 

[0118] The GRIP assay may be modified to identify inhibitors of a specific protein- 
protein interaction. For example, a receptor can be fused to a portion of GFP, while a 
ligand can be fused to a second portion of GFP. A test inhibitor, such as a test antagonist, 
can be incubated with the two GFP fusion proteins comprising the ligand and receptor to 
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see if it prevents the reassembly of GFP which can be detected by the loss of 
fluorescence. 

[0119] Specifically, nucleic acid encoding a fusion protein comprising a known receptor 
and a first portion of the GFP, nucleic acid encoding a fusion protein comprising its 

5 known ligand and the second portion of GFP, and a plasmid library of test antagonists can 
be cotransfected or cotransformed into host cells. Colonies that do not exhibit 
fluorescence are selected, since they contain GFP molecules that have been prevented 
from folding or reassembly and test antagonists that inhibit the interaction of the known 
receptor with its ligand. The colonies can be further cultured and investigated to 

10 determine the structural properties of the ligand. The molecular weight of the ligand may 
be determined by SDS-PAGE, and the primary structure may be determined by amino 



present a detailed description of certain preferred embodiments. It therefore should be 



15 apparent to those of ordinary skill in the art that various modifications and equivalents 
can be made without departing from the spirit and scope of the invention. All journal 
articles, other references, patents, arid patent applications that are identified in this patent 
application are incorporated by reference in their entirety. 



acid sequencing. 



[0120] It should be understood that the foregoing discussion and examples merely 
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